A Composite Method Based on Formal Grammar and DNA Structural Features in Detecting Human Polymerase II Promoter Region
نویسندگان
چکیده
An important step in understanding gene regulation is to identify the promoter regions where the transcription factor binding takes place. Predicting a promoter region de novo has been a theoretical goal for many researchers for a long time. There exists a number of in silico methods to predict the promoter region de novo but most of these methods are still suffering from various shortcomings, a major one being the selection of appropriate features of promoter region distinguishing them from non-promoters. In this communication, we have proposed a new composite method that predicts promoter sequences based on the interrelationship between structural profiles of DNA and primary sequence elements of the promoter regions. We have shown that a Context Free Grammar (CFG) can formalize the relationships between different primary sequence features and by utilizing the CFG, we demonstrate that an efficient parser can be constructed for extracting these relationships from DNA sequences to distinguish the true promoter sequences from non-promoter sequences. Along with CFG, we have extracted the structural features of the promoter region to improve upon the efficiency of our prediction system. Extensive experiments performed on different datasets reveals that our method is effective in predicting promoter sequences on a genome-wide scale and performs satisfactorily as compared to other promoter prediction techniques.
منابع مشابه
E-cadherin Promoter Methylation Comparison and Correlation with the Pathological Features of the Squamous Cell Carcinoma of Esophagus in the High Risk Region
E-cadherin is among tumor suppressor genes which mostly subjects to the down-regulation in squamous cell carcinoma of esophagus (SCCE). The gene is tightly associated with the tumor invasion and metastasis in multiple human cancers, especially SCCE. CpG islands’ methylation in the promoter region of E-cadherin is among the mechanisms that have been suggested for the E-cadherin silencing, howeve...
متن کاملHypermethylation of E-Cadherin and Estro-gen Receptor- Gene Promoter and Its Association with Clinicopathological Features of Breast Cancer in Iranian Patients
Background: Aberrant methylation of cytosine-guanine dinucleotide islands leads to inactivation of tumor suppressor genes in breast cancer. Tumor suppressor genes are unmethylated in normal tissue and often become hypermethylated during tumor formation, leading to gene silencing. We investigated the association between E-cadherin (CDH1) and estrogen receptor-α (ESRα) gene promoter methylation a...
متن کاملPolymorphisms in Promoter Region of the Interferon - Gamma Receptor1 Gene and its Relation with Susceptibility to Brucellosis
Background & Objective: Brucellosis is one of the most prevalent bacterial zoonotic diseases which afflicts both humans and animals. Genetic factors play an important role in susceptibility to brucellosis. One of these factors is interferon-gamma (IFN-g), which is vital in the defense mechanism against infectious diseases such as brucellosis. The purpose of this study was to ev...
متن کاملStudy of promoter CpG island hypermethylation of cyclindependent kinase inhibitor gene p21waf1/cip1 on some breast carcinoma cell lines
The p21 belongs to the CIP/KIP family of CDK inhibitors involved in cell cycle arrest at specific stages of the cell cycle progression. DNA methylation is the best studied epigenetic mark that have been evidently associated to chromatin condensation, and repression of gene transcription. The CpG island hypermethylation in promoter region of certain genes occurs in cancer cells and affects tumor...
متن کاملStructural basis of transcription: an RNA polymerase II-TFIIB cocrystal at 4.5 Angstroms.
The structure of the general transcription factor IIB (TFIIB) in a complex with RNA polymerase II reveals three features crucial for transcription initiation: an N-terminal zinc ribbon domain of TFIIB that contacts the "dock" domain of the polymerase, near the path of RNA exit from a transcribing enzyme; a "finger" domain of TFIIB that is inserted into the polymerase active center; and a C-term...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 8 شماره
صفحات -
تاریخ انتشار 2013